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I.  Introduction 


The  class  of  descriptor  systems  has  been  the  subject  of  numerous  studies 
in  recent  years  (see,  for  example  [1-9,20,21,23,24,26,27]).  The  fundamental 
property  with  which  all  of  these  studies  have  had  to  deal,  in  some  form  or 
another,  is  the  fact  that  the  system  function  matrix  for  a  descriptor  system 
need  not  be  proper,  leading  to  impulsive  behavior  in  continuous  time  and 
giving  rise  to  noncausal  responses  in  discrete  time.  The  noncausality  of 
these  models  makes  them  a  natural  choice  for  modeling  spatially,  (rather  than 
temporally)  varying  phenomena.  Indeed,  if  one  considers  generalizations  of 
descriptor  models  to  more  than  one  independent  variable,  one  finds  that  these 
models  arise  in  many  contexts  such  as  in  describing  random  fields, 
electromagnetic  problems,  gravitational  anomalies,  etc. 

In  the  context  just  described  it  is  natural  to  consider  descriptor  models 
together  with  boundary  conditions.  While  it  has  been  recognized  in  the 
literature  that  discrete-time  descriptor  models  are  often  not  well-posed  when 
initial  conditions  are  specified,  the  implications  of  using  general  boundary 
conditions  have  not  been  investigated  for  these  systems.  This  paper  presents 
the  initial  steps  in  such  an  investigation. 

There  have  been  two  principal  stimuli  for  our  work.  The  first  is  the 
work  of  Krener  [12-14]  who  developed  a  system  theory  for  standard  (i.e.  not 
descriptor)  continuous- time  linear  systems  with  boundary  conditions.  (See  also 
the  related  work  in  [15.16]).  Krener ’s  results  expose  the  richness  of 
boundary  value  models  and  a  number  of  important  concepts  such  as  new  notions 
of  recursion  that  are  more  natural  for  such  systems.  The  development  in  this 
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paper  parallels  Krener’s,  with  some  important  differences  required  to  deal 
with  the  possible  singularity  of  the  system  matrices  involved. 

The  second  stimulus  for  the  study  presented  here  has  come  from  our  work 
on  estimation  for  noncausal  process  [10,11,22],  In  particular  in  [22]  we  have 
examined  the  estimation  problem  for  boundary-value  descriptor  systems.  In 
addition  to  producing,  among  other  things,  both  algorithms  and  new  types  of 
generalized  Riccati  equations,  this  study  also  produced  a  number  of  questions. 
Is  the  optimal  estimator  stable  and  how  is  stability  related  to  reachability 
and  observability?  Do  reachability  and  observability  guarantee  existence  and 
uniqueness  of  positive  definite  solutions  to  the  generalized  Riccati 
equations?  Stepping  back  we  see  that  there  are  more  fundamental  questions. 
What  do  reachability  and  observability  mean  for  boundary-value  descriptor 
systems?  What  does  stability  mean  for  a  boundary-value  process  defined  on  a 
bounded  interval?  In  this  and  in  subsequent  papers  we  provide  answers  to 
these  questions. 

In  the  next  section  we  introduce  the  class  of  two-point  boundary-value 
descriptor  systems  and  investigate  their  wel 1-posedness .  This  leads  us  to  the 
introduction  of  a  normalized  form  for  these  systems.  This  form  not  only 
normalizes  the  boundary  conditions  in  a  manner  analogous  to  that  of  Krener  but 
it  also  brings  the  system  matices  into  a  form  that  leads  to  statements  of  a 
generalized  Cay ley-Hami l ton  theorem  and  of  reachability  and  observability 
conditions  that  are  significantly  simpler  than  ones  found  in  the  literature. 

In  Section  III  we  introduce  the  two  notions  of  recursion,  namely  inward  from 
and  outward  towards  the  boundary,  that  were  first  used  by  Krener,  and  we 
investigate  the  processes  associated  with  each.  These  provide  the  basis  for 


defining  two  concepts«-of  reachability  and  of  observability  which  are  then 
examined  in  detail  in  the  following  two  sections. 

Finally  in  Section  VI  we  discuss  the  efficient  solution  of  boundary-value 
descriptor  equations  and  then  close  with  a  brief  discussion  in  Section  VII. 

II.  Wei 1-Posedness  and  Normalized  Form 

The  two-point  boundary-value  descriptor  system  (TPBVDS)  considered  in 
this  paper  satisfies  the  difference  equation 

Ex(k+1)  =  Ax(k)  +  Bu(k) ,  k  =  0 . N-l  (2.1) 

with  the  two-point  boundary  condition 

V.x(O)  +  Vfx(N)  =  v  (2.2) 

and  output 

y(k)  =  Cx(k) .  k  =  0 . N  (2.3) 

Here  x  and  v  are  n-dimensional ,  while  u  and  y  are  m-  and  p-dimensional . 
respectively. 

As  in  [2],  we  can  rewrite  (2.1),  (2.2)  as  a  single  set  of  equations 

Jx  =  M  (2.  la) 


where 


28  =  diag  (B . B,  I)  (2.5b) 

We  see  from  this  immediately  that  the  wel 1-posedness  of  (2.1),  (2.2)  —  i.e. 

the  existence  of  a  unique  solution  x(k).  k  =  0,  1 . N,  for  any  choice  of  v 

and  u(k) .  k  =  0.  1 . N-l  — is  equivalent  to  the  invert ibi 1 i ty  of  /.  Note 

that  the  inver tibi 1 i ty  of  'J  implies  that  the  submatrix  consisting  of  all  but 
its  last  block  of  rows  has  full  row  rank.  This  in  turn  implies  that  a 
necessary  condition  for  wel 1-posedness  is  that  (E.A)  comprise  a  regular  pencil 
[17],  i.e.  that  o£  ♦  /3A  is  invertible  for  some  and  therefore  for  "most”  a  and 
P-  Consequently  throughout  this  paper  we  assume  that  this  is  the  case. 

An  important  aspect  of  regular  pencils  is  that  they  can  be  transformed 
into  a  form  that  greatly  simplifies  the  answering  of  numbers  of  questions. 


Definition  2.1:  A  regular  pencil  (E.A)  is  in  s tandard  form  if  for  some  i  uid 

P 

oE  +  p\  =  [  (2.5) 


Note  that  any  standard  linear  system  (with  E  =  I)  is  in  standard  form 
(take  a  -  l ,  p  =  0).  Furthermore  any  well-posed  TPBVDS  can  be  transformed  to 
standard  form.  Specifically,  find  a  and  P  so  that  |a£  +  p\ \  /  0  and 
premultiply  (2.1)  by  (aE  +  PA)  This  does  not  change  the  system  or  the 


"state"  variable  x,  but  the  new  E  and  A  matrices  now  satisfy  (2.6).  It  is 
worth  noting  that  one  can  also  deduce  that  any  regular  pencil  can  be  put  into 
standard  form  by  examination  of  its  Kronecker  canonical  form  [17],  although 
that  construction  involves  a  similarity  transf ormation  on  x  as  well  (see 
Section  VI ) . 

A  pencil  in  standard  form  has  a  number  of  important  properties  a  few  of 
which  are  summarized  in 

Proposi tion  2.1:  Suppose  that  (E,A)  is  in  standard  form.  Then 

(1)  E  and  A  commute  and  thus  have  a  common  set  of  generalized 
eigenvectors  (which  we  refer  to  as  generalized  system 
eigenvectors) . 

(2)  The  pencil  {E^.A^}  is  regular  for  all  k  >  0. 

(3)  For  any  k ,L  >0,  there  exist  coefficients  . an-l  so  t^at 

n-1 

EkAl  -l  a, A"-1'1  E1  (2.Y) 

i=0 

Proof :  Suppose  without  loss  of  generality  that  a  ^  0  in  (2.6).  Then 
E  =  a  I  +  6A  where  t  =  1/a  and  6  =  -p/a.  The  commutativity  of  E  and  A  then 
follows  immediately.  The  remainder  of  (1)  follows  from  the  fact  that  E  and  A 
can  be  put  into  Jordan  form  by  the  same  similarity  transformation.  Indeed  tilt 
Jordan  blocks  must  be  of  commensurate  dimensions  (i.e.  no  block  of  E  or  A  can 
straddle  rows  of  several  blocks  of  the  other  without  extending  to  include  all 
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of  the  rows  of  those  blocks). 

Assume  then  that  E  and  A  are  in  Jordan  form.  Since  {E,A}  is  regular,  E 

and  A  cannot  have  a  zero  eigenvalue  associated  with  a  common  eigenvector. 

k  ( 

This  in  turn  implies  statement  (2).  Finally  to  prove  (3),  take  any  E  A  and 

replace  E  by  il  +  6A.  Then  apply  the  usual  Cayley-Hami 1  ton  theorem  to  all 

Ic 

powers  of  A  higher  than  n-1.  Finally,  multiply  each  A  in  the  resulting 

n — k —  1 

expression  by  I  =  (aE  +  (3A)  .  Expanding  yields  an  expression  of  the  form 

of  (2.7) 

Statement  (3),  which  states  that  {An  ^,EAn  ^,...En  *}  span  the  same 
k  i  i 

subspace  as  {A  E  |k.t  >  0},  is  a  generalization  of  the  Cay ley-Hami 1  ton 
theorem.  Note  that  this  statement  is  considerably  simpler  than  those  in  the 
literature  [6,8,28]  for  pencils  not  in  standard  form. 

Standard  form  also  provides  us  with  a  simpler  wel 1-posedness  condition: 

Theorem  2.1:  Suppose  that  {E.A}  is  in  standard  form.  Then  the  system  (2.1). 
(2.2)  is  well-posed  if  and  only  if 

N  N 

V.E*  +  VfA  (2.S) 

is  invertible. 

Proof :  One  method  for  deriving  this  result  is  to  apply  row  elimination  to 
solve  for  x(0)  and  x(N)  from  (2.4).  Methods  similar  to  this  will  be  used  in 
the  next  section  in  defining  inward  and  outward  processes.  In  this  proof  we 
use  a  different  method  that  provides  some  computations  we  can  use  immediately 
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For  example,  two  4x4  matrices  in  Jordan  form,  one  with  two  2x2  Jordan 
blocks  and  the  other  with  one  3x3  and  one  1  x  1  Jordan  block,  don't  commute 


To  begin,  iet  <j  be  any  number  such  that 


r  =  .en+1  -  an+1 


(2.9) 


is  invertible  (this  can  always  be  done  since  (En+^,  An+^}  is  regular).  Then 


we  can  express  'f  as 


where 


wi  th 


and 


*  =  *1*2 
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N  N-l 

sNN  =  (V1En  +  VfAn)r 


(2.10) 


(2.11) 


(2. 12a) 
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o 


Consequently  if  is  invertible  if  and  only  if  i/^is  invertible.  Examining 
(2.11),  (2.12)  we  see  that  this  is  the  case  if  and  only  if  the  matrix  in  (2.S) 
is  invertible. 

Definition  2.2:  The  system  (2.1),  (2.2)  is  in  normal ized  f orm  if  (E.A)  is  in 
standard  form  and  if 

N  N 

V.E  +  V.A  =  I  (2.15) 

if 

This  form  is  the  counterpart  of  Krener’s  standard  form  in  [12-H]  Note  that 
any  well-posed  system  can  be  put  in  normalized  form  by  left  multiplication  of 

(2.1)  and  (2.2).  Specifically  we  first  transform  {E.A}  to  standard  form  as 

described  previously,  to  obtain  new  E  and  A  matrices,  and  we  then  multiply 
N  N  -1 

(2.2)  by  (V.E  +  V^.A  )  to  obtain  new  V.  and  V^.  matrices  satisfying  (2.15). 
From  this  point  on  we  assume  that  (2.1),  (2.2)  is  in  standard  form. 

Next,  note  that  if  (2.8)  is  invertible,  the  inverse  of  has  the  same 
form  as  (2.11)  except  that  the  last  block  row  of  S ^  *  is 

l_c-1c  _s“ 

v  NN  NO’  NN  Ni .  NN  N.N-1’  NN; 

Using  the  expressions  for  i/^  ^  and  ^  we  can  then  write  down  the  Green's 
function  solution  of  (2.1),  (2.2): 


(2.16) 


N-l 

x(k)  =  AkE^-kv  +  J  G(k,t)Bu(l) 
1=0 

where 


,  Ak  [A-EN  k(V.A+wVfE)Ek]E6  kAN  4  V  1  .  6>k 

^  EN'k[(jE-Ak(V.A+(jVfE)AN_k]EtAk-<'_1r_1  ’  6<k 


(2.17) 


Here  G(k,l)  is  called  the  Green’s  function  cf  the  TPBVDS.  When  E  and  A  are 
both  invertible,  (2.17)  can  be  simplified. 


G(k, L) 


-a^V^a"-4-1  .  Qk 


AkEN'kViEtA_6_1 


(2.18) 


For  simplicity,  in  the  rest  of  the  paper  we  assume  that  T  is  invertible 

for  w  =  1  and  use  the  expression  (2.17)  for  G  with  u  set  equal  to  1.  This 

s  t 

assumption  is  equivalent  to  assuming  that  no  (N+l)  root  of  unity  is  an 
eigenmode  of  the  system  (where  a  is  an  eigenmode  if  |ctE-A|  =0).  All  of  the 
results  in  the  paper  have  obvious  extensions  to  the  case  of  an  arbitrary  value 
of  w,  as  we  simply  must  carry  <j  along  in  the  various  expressions. 


10 


III.  Inward  and  Outward  Processes 

One  of  Krener’s  most  important  observations  in  his  work  was  that 
boundary-value  systems  admit  two  notions  of  recursion,  namely  expanding  inward 
from  or  outward  toward  the  bounder ies.  In  this  section  we  introduce  the 
counterparts  to  these  notions  for  TPBVDS.  As  we  will  see,  the  possible 
singularity  of  both  E  and  A  leads  to  several  differences  in  our  context. 

Each  of  the  processes  associated  with  these  recursions  have 
interpretations  as  state  processes:  the  outward  process  summarized  all  that 
one  needs  to  know  about  the  input  inside  any  interval  in  order  to  determine  x 
outside  the  interval,  while  the  inward  process  simply  uses  input  values  near 
the  boundary  to  propagate  the  boundary  condition  inward.  In  Krener’s  context 
the  outward  process  represented  a  "jump’’,  i.e.  the  difference  between  x  at  one 
end  of  any  interval  and  the  value  predicted  for  x  at  that  point  given  x  at  the 
other  end  interval  and  assuming  zero  input  inside  the  interval.  In  our 
context  we  cannot  necessarily  predict  in  either  direction  (because  of  the 
possible  singularity  of  E  and  A)  and  therefore  must  use  a  slightly  modified 
definition  of  the  outward  process: 

z  (k.t)  =  EL~kx(L)  -  At_kx(k).  k<L  (3.1) 

Note  that  this  definition  agrees  with  Krener’s  if  E  =  I.  However,  in  general 
z  (k.l)  can  only  be  propagated  outward  whereas  in  Krener’s  case  the  outward 
process  could  be  propagated  inward  as  well.  An  explicit  expression  fur 
zQ(k,f)  in  terms  of  the  inputs  between  k  and  6  can  be  obtained  by 


premultiplying  (2.4)  by 


This  yields 


[0 . 0.  Ak  L  1  ,EAk~^~2 . Ek  1  ^0 . 0] 

L- 1 

z(k.<,)=^  Ej_kAt-j_1Bu(j) 
j=k 


Also,  we  have  the  recursive  relations 


ZQ(k-l , L)  =  EzQ(k ,t)  +  A6  kBu(k-l) 
zQ(k, L+l )  =  AzQ(k .1)  +  E6_kBu(6) 


(3.2) 


(3.3) 

(3.4) 


Furthermore,  as  in  [14]  it  is  straightforward  to  show  that  the  four-point 
boundary-value  system 


Ex(k+1)  =  Ax(k)  + 

Bu(k) 

(3.5) 

V.x(0)  +  Vfx(N) 

=  V 

(3.6) 

Kx(L)  -AL“Kx(K)  = 

zo(K,L) 

(3.7) 

has  the  same  solution  as  (2.1),  (2.2)  for  ke[0,N]  \  [K+l.L-1]  (i.e.  over  [0.K] 
and  [L.N]),  so  zq(K.  L)  does  indeed  summarize  ail  we  need  to  know  about  inputs 
between  K  and  L. 

The  inward  process  z^(k ,t)  can  also  be  defined  in  a  manner  analogous  to 
[14].  Unfortunately  in  the  present  context  z.(k,t)  is  a  complex  function  of 
the  boundary  matrices,  the  boundary  value  v,  and  the  inputs  u(j), 
je[0,N-l]  \  [k,l-l].  Specifically,  as  we  demonstrate  below,  for  k <i.  z.(k,f) 


has  the  form 


z.(k,l)  =  W.(k,l)x(k)  +  Wf(k ,l)x(l) 

=  Fkc[u(0).u(l).....u(k-I).u(t).u(t+1) . u(N-l).v]  (3.S) 

and,  in  addition 

z.(O.N)  =  v.  W.(O.N)  =  V.,  Wf(0,N)  =  V  (3.9) 

z.(k,k)  =  x(k)  =  F^ufO) . u(N-l),  v)  (3.10) 

where  the  F^  are  linear  functions  of  their  arguments.  Furthermore  the  TPBVDS 

Ex(k+1)  =  Ax(k)  +  Bu(k)  (3.11) 

W.(K,L)x(K)  +  Wf(K,L)x(L)  =  z.(K.L),  (3.12) 

has  the  same  solution  as  (2.1),  (2.2)  for  ke[K,L],  so  z.(K,L)  does  indeed 
represent  an  inwardly-propagated  boundary  condition  for  the  original  system. 

Let  us  first  indicate  how  (3.8)  -  (3.10)  can  be  computed  in  a  recursive 
manner.  The  basic  idea  here  is  to  eliminate  values  of  x  near  the  boundary 
from  (2.4)  in  order  to  obtain  a  reduced  set  of  equations.  The  resulting 
right-hand  side  will  then  involve  the  remaining  u’s  and  a  new  boundary 
condition  (see  (2.4c)).  Specifically,  suppose  we  wish  to  propagate  one  step 
in  from  the  left,  i.e.  to  compute  z  (l.N).  Note  that  for  'S  in  (2.5a)  to  be 
invertible  it  is  necessary  for 

-A 

V. 


to  have  full  column  rank.  Consequently  we  can  find  a  block  matrix  [T  P]  of 


full  row  rank  so  that 


[T  P] 


Premultiplying  (2.4)  by  the  matrix 

I  . 0  0 

0  . I  0 

0  . 0  P 


(3.13) 


(3.14) 


then  eliminates  x(0)  and  leaves  us  with  the  following  TPBVDS  on  [1,N]: 


Ex(k+1)  =  Ax(k)  +  Bu(k) 


(3.15) 


TEx(l)  +  PVfx(N)  =  Pv  +  TBu(O) 


(.3.16) 


It  is  easy  to  see  that  this  system  is  well-posed,  since  rank  (lif)  =  rank  (0) 
rank  ('./)  -  n  and  the  system  is  defined  over  an  interval  with  one  less  time 
step.  The  boundary  matrices  in  (3.16)  are  not  in  necessarily  in  normalized 
form,  so  we  then  need  to  premultiply  (3.16)  by 


yielding 


A  =  (TEN  +  PVfAN  l)  1 

W.(l.N)  =  ATE  .  Wf(l.N)  =  APVf 
FlN[u(0).v]  =  APv  +  ATBu(O) 


(3. it; 

(3. is; 
(3. 19; 


In  a  similar  fashion  we  can  move  the  right  boundary  inward,  in  this  case 
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premultiplying  (2.4)  by 


I 

0 

0 

0 


0  .  0 

I  .  0 

0 .  I 

0 .  S 


0 

0 

0 

Q 


(3.20) 


when  [S  Q]  is  a  full-rank  solution  of 


[S  Q] 


E 

V, 


=  0 


(3.21) 


It  is  also  possible  to  obtain  direct  rather  than  recursive  expression  for 
the  W’s  and  at  the  same  time  to  expose  the  relationship  between  the  inward  and 
tward  processes  that  we  will  use  in  Section  V.  Using  the  expression  (3.1) 


ou 


for  the  outward  process  zq  and  (2.4)  we  can  write 


-A 

0 

0 

V. 


0 


0 

0 


-A 

0 


0 


-\L~k  el  k  0 


N-f 


N-t 

♦ 

f 


x(0) 

x(k) 

= 

x(!) 

.  x(N). 

zo(0.k) 

zo(k.t) 

ZQ(<-.n) 


(3.22) 


As  we  did  earlier,  we  construct  a  full-rank  matrix 
[T  (k.l).  Tf (k. 1) ,  P(k, t)]  so  that 


[T.(k,t),  Tf(k.t).  P(k.t)] 
If  we  then  multiply  (3.22)  by 


-Ak  0 


N-t 
i  f 


0  E' 
V.  V 


=  0 


(3.23) 


we  obtain 


Q  ( k.i )  = 


0 

T.(k.t) 


I 

0 


0  0 
Tf(k,t)  P(k, l) 


(3.24) 


L-  k 


-A 


Ti(k.t)E 


CL- k 
L 

N -i 

-Tf(k.t)A  L 


x(k) 

u 

o 

N 

i _ 

x(f) 

Ti(k.f)zo(0.k)+Tf(k.f)zo(f ,n)+P(k.f )v 

(3.25) 


Equation  (3.25)  is  essentially  the  result  of  eliminating  all  variables  in 
(2.4)  other  than  x(k)  and  x(t)  by  propagating  outward  to  summarize  all  inputs 
between  k  and  i  and  inward  to  summarize  the  effect  of  the  boundary  condition 
and  inputs  from  0  to  k  and  i  to  N.  Consequently  we  can  identify  the  second 
block  of  equations  as  specifying  an  unnormalized  version  of  the  inward 
process.  Therefore  letting 


A(k. 6)  =  [Ti(k.t)Et  -  Tf(k,t)AN  k]  1  (3.26) 

we  have 

W.(k ,t)  =  A(k.t)T,(k.f)Ek  (3.27) 

Wf(k  ,L)  =  -A(k.t)Tf(k.<,)AN“<'  (3.2S) 

and 

z,(k.()  =  A(k.f)[T.(k./)zo(0.k)  ♦  Tf(k.t)zo(i.N)  +  P(k.f)v]  (3.26) 


In  the  case  of  standard  linear  systems  reachability  corresponds  to  the 
ability  to  drive  the  state  of  the  system  to  an  arbitrary  value  by  appropriate 

1G 


sfi 
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choice  of  the  input  sequence.  It  is  well  known  that  if  such  a  system  is 
reachable  it  is  possible  to  reach  an  arbitrary  state  value  by  proper  choice  of 
the  n  previous  input  values,  where  n  is  the  dimension  of  the  system.  In  the 
case  of  a  TPBVDS,  however,  there  is  a  distinction  between  the  concept  of 
reachability  by  choosing  the  inputs  in  an  n-point  neighborhood  and  the  concept 
of  reachability  by  choosing  the  inputs  in  the  whole  domain  of  definition  (i.e. 
[0,  N]).  The  first  concept  we  shall  refer  to  as  strong  reachability  and  the 
second  concept  as  weak  reachability.  These  concepts  correspond,  respectively, 
to  Krener’s  reachability  on  and  reachability  off  which  he  in  turn  defines  in 
terms  of  the  outward  and  inward  processes,  respectively.  We  shall  do  the  same 
in  the  next  two  sections  in  which  we  also  analyze  the  corresponding 
observability  concepts. 

IV.  Strong  Reachability  and  Observablity 

We  begin  with  an  examination  of  reachability,  and  for  this  we  need  the 
following 

Definition  4.1:  The  system  (2.1),  (2.2)  is  strongly  reachable  on  [K.I.]  it'  the 
map 

(u(k)  |ke[K,L]}  -  z^K.L) 

is  onto.  The  system  is  strongly  reachable  if  it  is  strongly  reachable  on  some 
interval . 


From  (3.2)  we  can  write 


(4.1) 


zo(K,L)  =  RS(L-K) 


u(K) 

u(L-l) 


where 


R  (j)  =  [Aj  2B\ .  .  .  !Ej  :B] 


(4.2) 


In  anticipation  of  the  following  result,  define  the  strong  reachability  matrix 


R  =  R  (n) 
s  s' 


(4.3) 


and  strongly  reachable  subspace 


as  =  Im(Rs) 


(4.4) 


Theorem  4.1:  The  following  statements  are  equivalent: 


1 

■> 

(a) 

(b) 

1 

(c) 

(d) 

desired  value  by  proper  choice  of  inputs  u(j).  jt[k-n, k+n-i ] .  and 


this  can  be  accomplished  for  any  choices  of  V.  and  for  which 


(2.1),  (2.2)  is  well-posed. 


Before  proving  this  result,  let  us  make  several  comments.  Note  first 
that  condition  (c)  is  one  of  the  reachability  conditions  found  in  the 
descriptor  literature  [7,26],  By  introducing  the  standard  form  of  a  regular 
pencil  we  are  able  to  obtain  a  condition,  namely  that  (4.2)  is  of  full  rank 
for  j  =  n.  that  is  far  simpler  than  those  presented  previously.  Note  also 
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that  as  for  standard  linear  systems,  condition  (6)  tells  us  that  a  system  is 
strongly  reachable  if  and  only  if  it  is  strongly  reachable  over  intervals  of 
length  n.  On  the  other  hand,  in  condition  (d)  we  require  that  x(k)  can  be 
driven  to  an  arbitrary  value  by  applying  appropriate  inputs  over  the  2n-point 
symmetric  neighborhood  of  k.  In  fact,  one  only  needs  an  n-point  neighborhood 
of  k,  but  the  extent  of  this  interval  before  and  after  k  depends  on  the 
matrices  E,  A,  and  B  (i.e.  on  the  causal /anti causal  structure  of  (2.1)) 
Condition  (d)  simply  uses  the  union  of  all  such  n-point  intervals  and 
therefore  is  approprate  for  al 1  TPBVDS.  Finally,  note  that  strong 
reachabilit  does  not  depend  on  the  boundary  matrices  V.and  V^.  (as  long  as 
(2.1),  (2.2)  is  well-posed).  This  can  be  seen  directly  from  the  definition  of 
ZQ(k ,L)  or  from  condition  (b) . 

Proof :  The  equivalence  of  (a)  and  (b)  follows  immediately  from  the 
generalized  Cay ley-Hami 1  ton  theorem  (statement  (3)  of  Proposition  2.1)).  As 
an  alternate  proof,  note  that 

Im[R  (k+1)]  =  ERs(k)  +  ARs(k)  (1.) 

so  that  Im[Rs(k+2)]  =  ImfR^fk+l)]  if  Im[R^( k-t- 1 ) ]  =  Im[R^(k)].  Also,  thanks  to 

(2.6) 

Itn[Rs(k)]  C  Im[Rs(k+l)]  (  1.6) 

Simple  dimension  counting  then  shows  that 
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Finally,  consider  the  equivalence  of  statements  (b)  and  (d).  Because  of 


the  linearity  of  the  system,  we  can  assume  that  v  =  0  and  u(j)  =  0  for 
je[0,k-n-l]  and  je[k+n,N].  In  this  case  (2.16),  (2.17),  and  (3.2)  allows  us 

to  write 

W  N-k  k  -1  N-k-n 

x(k)  =  AK[A-E  K(V.A+VfE)E  ]r  *  2Q(k.k+n) 

+  EN_k[E-Ak(V.A+VfE)AN"k]r_1Ek"nzo(k-n.k)  (1.12) 

Let  f  be  an  arbitrary  vector  and  choose  inputs  u(j).  jt[k-n,k-l]  so  that 
ZQ(k-n.k)  =  Enf  and  u(.j),  jt[k.k+n-l]  so  that  ZQ(k.k+n)  =  -Anf .  With  these 
choices  which  can  be  found  since  has  full  rank,  (4.12)  reduces  to 


choice  for  V.  and  V,.  is  in  normalized  form.  Let  us  take  v  =  0  and  u(  i)  =  O 
it 

for  ,jt[0,k-n-l]  and  ,jf[k  +  n.iN].  Then  in  ttiis  case  (2.10).  (2.  If)  reduces  to 


:(k)  =  A  An  4N  n  ^(k-n)  +  An  V  nBu(k-n-l)  +  . 

+  ENBu(k-l)  +  -rA^Bu(k)  +  -tAN_1EBu(  k+ 1 )  + 

r,n-l  . N-n+  In  , ,  ,  .  1 

+  -rE  A  Bu(k+n-l ) 


The  range  of  the  mapping  defined  in  (4.16)  is 


EN"n_1a  ♦  AN_n+1a  ' 
s  s 


Assuming  that  (d)  is  true,  this  must  also  be  all  of  IR  .  Consequently  we 

conclude  that  $  =  IRn  for  this  choice  of  V.,Vr.  Thanks,  then,  to  statement 
s  l  f 

(c)  of  the  theorem,  we  see  that  Jl  =  [Rn  for  any  V..Vr  for  which  the  TPBVDS  is 

s  if 

well-posed,  so  that  statement  (a)  also  must  hold. 


We  next  wish  to  consider  the  dual  concept  of  strong  observability.  To  do 
this  we  proceed  in  a  manner  analogous  to  that  for  casual  linear  systems. 
Specifically,  for  such  systems  observability  corresponds  to  being  able  to 
reconstruct  the  state  at  some  point  in  time,  given  present  and  future 
observations,  when  all  future  inputs  are  zero.  The  counterpart  to  this  in  our 
context  is  the  following. 


Def ini t ion  4.2:  The  system  (2.1)  -  (2.3)  is  strongly  observable  on  [K,L]  if 
the  map 


z.(K,L)  ->  (y(k)  |ke[K,L]} 


(4.  lV- 


defined  by  (3.11),  (3.12)  wi th  u  =  0  is  one-to-one.  The  system  is  s t  rung  1 « 
observable  if  it  is  strongly  observable  on  [K.L]  for  all  K.L  such  that 


L-K  >  n-1 
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Since  (3.11),  (3.12)  is  in  normalized  form,  we  can  adapt  the  Green’s 
function  solution  (2.16)  to  obtain  an  explicit  expression  for  the  mapping 


defined  in  (4.17).  Specifically 


where 


y(K) 

y(K+i) 

y(L) 


Os(L-K)z.(K.L) 


(4. IS) 


os(j) 


CEJ 

CAEj_1 

CAJ 


(4. 19) 


In  analogy  with  our  reachability  results,  we  define  the  strong  observability 
matrix 


°s=0s(n_1)  (4.20) 

and  the  strongly  unobservable  subspace 

Cs  =  ker(Os)  (4.21) 


Theorem  4.2:  The  following  statements  are  equivalent. 


(a)  The  system  (2.1)  -  (2.3)  is  strongly  observable. 

(b)  The  strong  observability  matrix  0g  has  full  rank. 

( c )  The  matrix 


sE  -  tA 
C 

has  full  rank  for  all  (s,t)  ^  (0,0). 

(d)  The  state  x  at  any  point  ke[n,N-n]  can  be  uniquely  determined 
from  the  outputs  y(j).  je[k-n,k+n-l]  and  u(j),  je[k-n,k+n-2] . 

This  can  be  accomplished  for  any  choice  of  V.  and  for  which 
(2.1).  (2.2)  is  well-posed. 

The  proof  of  this  theorem  is  analogous  to  that  for  Theorem  -1. 1  and 
therefore  is  omitted.  Also,  one  can  make  similar  comments  concerning  this 
result.  For  example,  thanks  to  the  generalized  Cay ley-Hami 1  ton  theorem, 
statement  (b)  is  considerably  simpler  than  expressions  that  have  appeared 
previously.  Also,  strong  observability  depends  only  on  E.  A.  and  C  and  not  on 
the  particular  choice  of  boundary  matrices  V.  and  V^. 

V.  Weak  Reachability  and  Observability 

As  Krener  noted,  in  contrast  to  strong  reachability  and  observability, 
the  concepts  of  weak  reachability  and  observability  depend  intimately  on  t  tie 
particular  choice  of  boundary  matrices,  as  the  structure  of  these  matrices  can 
increase  reachability  and  observability  beyond  that  which  might  be  apparent 
from  an  examination  of  system  dynamics  alone.  The  examination  of  these  weaker 
concepts  for  TPBVDS  is  somewhat  more  complicated  than  in  Krener 's  case  because 
of  the  possible  singularity  of  E  and  A. 
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Def ini tion  5.1:  The  system  (2.1),  (2.2)  is  weakly  reachable  off  [K.L]  if  the 

map  defined  in  (3.8),  with  v  =  0,  is  onto.  The  weakly  reachable  subspace 

®w(K,L)  is  the  range  of  this  map.  The  system  is  weakly  reachable  if  it  is 

weakly  reachable  off  [K,L]  (i.e.,  if  91  (K.L)  =  IRn)  for  all  K,Le[n,N-n]. 

w 

Note  that  the  weak  reachability  condition  is  a  natural  counterpart  to  the 
causal  reachability  definition  in  which  we  require  that  the  state  can  be 
driven  to  an  arbitrary  value  from  zero  initial  condition.  Also,  note  the  use 
of  the  wording  "reachable  off" ,  emphasizing  the  fact  that  the  inputs  used  in 
this  case  are  confined  to  the  exterior  of  the  interval  [K,L]. 

An  important  property  of  a  causal  system  is  that  the  dimension  of 
reachable  space  does  not  change,  and  in  fact  the  reachable  space  itself  is 
time-invar iant .  The  following  theorem  shows  that  the  first  of  these 
statements  is  also  true  for  TPBVDS's.  Example  5.1  later  in  this  section  shows 
that  the  second  is  not. 

Theorem  3.1:  The  dimension  of  J?w(K.L)  is  constant  for  K . Le [n , N-n] . 

Proof :  Let  K.L  be  any  points  in  [n,N-n].  From  (3.29)  (with  v  set  to  0)  we 

see  that 

yK.L.)  =  A(K.L)[T.(K.L).<s4-Tf(K.L).^]  (5.1) 

Now  assume  that  K-le[n,N-n]  as  well.  We  would  like  to  show  that 
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To  do  this,  we  first  must  find  T^(K-l.L)  and  T^(K-1,L).  In  fact,  what  we  show 
is  that  a  possible  set  of  choices  for  T  ,  T^.,  and  P  is 


T.(K-l.L)  =  T.(K,L)A 

(5.3a) 

Tf(K-l,L)  =  Tf(K.L) 

(5.3b) 

P(K-l.L)  =  P(K.L) 

(5.3c) 

where  A  has  the  same  eigenstructure  as  A  except  that  the  zero  eigenvalue  in  A 

~  3 

has  been  replaced  by  1  in  A.  Without  loss  of  generality  we  can  assume  that  A 
is  in  the  Jordan  form 


A  = 


J  0 
0  N 


(5.4) 


where  J  is  invertible  and  N  is  nilpotent.  In  this  case 


A  = 


J  0 

0  N+I 


(o.o) 


For  (5.3)  to  be  a  valid  choice,  two  conditions  must  be  satisfied.  First 
[T^(K-1  ,L)  ,Tj.(K-l  ,L) ,  P(K-1  ,L)]  must  be  of  full  rank.  This  is  obviously  the 
case  since  [T  (K.L) ,Tj.(K,L) , P(K,L) ]  is.  and  A  is  invertible.  Secondly  we  must 


Since  similarity  transformations  have  no  effect  on  the  dimension  of  the 
reachability  spaces. 


show  that  (3.23)  is  satisfied  with  k  =  K-l  and  L  =  L,  i.e.,  we  must  verify 


-T.(K.L)AAK  1  +  P(K.L)V.  =  0  (5.6) 

when  we  know  that 

-T.(K.L)AK  +  P(K.L)V.  =  0  (5.7) 

However,  since  K-l  >  n,  *  =  0.  so  that  AA^  *  =  A^. 

Consequently,  we  can  write 

9t  (K-l.L)  =  A(K-l,L)[T.(K.L)A2ft  +  T  (K.L)i*  ]  (5.S) 

W  1  S  l  S 

(Note  that  (4.25)  may  not  be  valid  if  K-l  <  n,  since  Sfts(K-l)  may  be  smaller 
than  ^s)  Comparing  (5.1)  and  (5.8)  and  using  the  fact  that  the  A(k,l)  are 
all  invertible,  we  see  that  (5.3)  will  hold  if  we  can  show  that 


(5.9) 


Note  first  that  A$s  so  that  (5.9)  is  clearly  true  if  A  is  invertible.  If 

A  is  singular,  note  that  a  cannot  be  zero  in  aE  +  /3A  =  I ,  so  that  9t ^  is  given 
by  (4.8).  Then  assuming  that  A  and  A  are  as  in  (5.4)  and  (5.5)  and  using  the 
fact  that  J  is  invertible,  we  see  that  (5.9)  will  hold  if  we  can  show  that 

0  0 

SJ  ca  (5.  10) 

AT  ^  S 


2 


If  we  partition  B  compatably  with  (5.4)  we  see  that 


B, 


a 

s 


Im 


B, 


J®! 

NBr 


■Jn  1B1 


.#  lB2  0 


I 

(5.11)  ! 

I 

I 

» 

I 


(5.12) 


where  p  is  the  nilpotency  degree  of  N.  Let  J  be  n^xn^  and  N  be  n^,xn9  (so  that 
nj+n£  =  n  and  p  <  n^).  Suppose  that  [f ^ we  wish  to  show  that 

[0.?2  ]’e^s-  However,  if  [f  i  •  ^2  ^  '  e^s '  t*iere  exisc  inputs  u..i  =  0 . p  -  1 

so  that 


4-1 

=  I  NVt 


(5.13) 


i=0 


We  then  wish  to  show  that  we  can  augment  this  sequence  with  u..i 


=  4 . n  so 


that 


i . e . ,  so  that 


n-1 

I  jVi  =  0 

i=0 

i=p  i=0 


(5. 14) 


(5. 15) 


The  right-hand  side  of  (5.15)  is  in  the  reachable  space  of  (J.B^). 

Furthermore,  since  n-l-4  >  n^-l,  the  left-hand  side  of  (5.15)  can  be  driven  to 


any  point  in  the  reachable  space  of  (J,B^). 

So  far  we  have  shown  that  ^(K-l.L)  has  the  same  dimension  as  ^(K.L)  as 
long  as  K-l  >  n.  In  a  similar  manner  we  can  show  that  ^(K,L+1)  has  the  same 
dimension  as  well,  as  long  as  L+l  <  N-n.  This  then  completes  the  proof  of  the 
theorem. 

Note  that  one  immediate  consequence  of  Theorem  5.1  is  the  following 

Coro  1 lary :  The  system  (2.1)  -  (2.2)  is  weakly  reachable  if  it  is  weakly 
reachable  off  some  [K.L]  with  K,L£[n,N-n], 

Hence,  in  order  to  test  for  weak  reachability  we  need  only  examine  the 
reachability  space  J1  (k.k)  of  z.(k,k)  =  x(k)  for  any  ke[n,N-n],  Note  further 

W  1 

that  at^fk.k)  is  the  range  space  for  the  map  from  (u(0) . u(N-l)}  to  x(k) 

(with  the  boundary  value  set  to  zero);  i.e.,  weak  reachability  corresponds  to 
being  able  to  drive  x(k)  to  an  arbitrary  value  using  the  entire  interval  of 
the  controls.  Thanks  to  statement  (d)  of  Theorem  4.1,  we  see  that  weak 
reachability  is  indeed  weaker  than  strong  reachability  which  corresponds  to 
being  able  to  drive  x(k)  to  an  arbitrary  value  using  only  inputs  within  n  time 
steps  of  k. 

While  (5.1)  provides  in  principal  a  method  for  computing  weakly  reachable 
subspaces,  it  involves  a  significant  amount  of  computation  in  order  to 
determine  A(K,L),  T^(K,L),  and  Tj.(K,L).  As  the  next  theorem  shows,  there  is 
an  easier  method  for  computing  ^(k.k). 


Theorem  5.2-  Let  ke  [n,N-n].  Then 


“*w(k,k)  =  Im[AkEN  k(V . A  +  VfE)Rs  !  R  ] 

=  Im[AkEN_kV.R  !AkEN"kVrR  !R  ] 

IS  f  s  s 

Proof •  From  (2.16),  (2.17)  (with  w  =  1  for  simplicity)  we  see  that 

$w(k.k)  =  Im[Ak(A-£N~k(ViA+VfE)Ek)R(N-k):EN_k(E-Ak(ViA+VfE)AN_k)R(k)] 

That  is,  if  wfc^(k.k).  then  there  exist  x,yefts  so  that 

w  =  Ak[A-EN_k{V.A+VfE)Ek]x  +  EN_k[E-Ak(V.A+VfE)AN~k]y 

=  (Ak+Ix  +  EN_k+1y)  ~  AkEN'k(ViA+VfE)[Ekx+AN"ky] 

Since  Jl  is  E-  and  A-  invariant,  we  see  that 
s 

\(k.k)  C  Im[AkEN_k(V.A+Vf.E)Rs:Rs] 

The  first  equality  in  (5.16)  will  be  proved  then  if  we  can  show  that 
the  range  of  [AkE^  k(V  A+V^.E)Rg ! Rg]  is  in  £  (k.k).  Clearly  any  such 
writ  ten  as 

k  N-k 

w  =  s  -  AT  (V.A+V  E)t 


(5.16) 


(5.17) 


(5. IS) 


(5.19) 

any  w  in 
w  can  be 

(5.20) 


( 5 . 26 ) 


Im[(V.A+VfE)Rs:Rs]  =  Im[ ( V.Rg J Vf Rg] 

Final ly 

Im[AkEN_k(V.A+VfE)Rs:Rs]  =  AkEN_kIm[(V. A+VfE)Rs !R  ]  + 

=  AkEN_kIm[V.R  !V.R  ]  +  A 
1  s  f  s  s 

=  Im[AkEN_kV.Rs:AkEN_kVfRs:Rs]  (5.27) 

Note  from  (5.16)  that  91^  C  iS^(k.k)  for  k  e  [n,N-n],  consitent  with  our 
earlier  statement  that  weak  reachability  is  indeed  a  weaker  condition. 

Examp  1 e  5.1:  Consider  the  system  (2.1).  (2.2)  with 


1 

0 

0 

1 

E  =  I.  A  = 

0 

0 

1 

.  B  = 

0 

0 

1 

0 

0 

1 

0 

0 

0 

0 

0 

0 

l 

0 

•  vf  = 

0 

0 

0 

l 

0 

1 

1 

0 

0 

This  system  is  in  normalized  form  for  all  N.  Since  E  =  I.  strong  reachability 

reduces  to  the  usual  notion  of  reachability  for  causal  systems.  Clearly  then 

T 

is  spanned  by  the  vector  [1,0.0]  .  From  (5.16)  we  find  that  ^(k.k)  is 

r  t  r  i' 

spanned  by  [1,0,0]  and  [0,0,1]  for  k  even  and  by  [1,0,0]  and  [0.1,0]  for  k 
odd.  This  example  illustrates  the  mechanism  through  which  some  states  mav  be 
weakly  but  not  strongly  reachable.  It  also  demonstrates  another  fact  peculiar 
to  boundary- value  systems:  while  the  dimension  of  ^(k.k)  remains  constant 
for  kt[n,N-n],  this  subspace  is  not  dynamical ly- invar iant .  In  particular. 


A  .It.) 


while  the  dynamics  (o.2S)  allow  the  input  to  influence  only  the  first 


component  of  x(k),  the  boundary  matrices  (5.30)  couple  the  first  and  third 


components,  allowing  indirect  control  of  the  third.  The  A-matrix  then 


produces  the  oscillatory  behavior  in  A^(k,k) 


Theorem  5.2  provides  us  with  a  computable  weak  reachability  condition- 


we  check  to  see  if  either  of  the  matrices  in  (5.16)  is  full  rank.  The 


following  result  provides  a  simpler  result  of  this  type  as  no  powers  of  E  or  \ 


must  be  computed. 


Theorem  5 . 3 :  The  system  (2.1), (2.2)  is  weakly  observable  if  and  only  if 


either  of  the  matrices 


[EA(V.A+VfE)Rs:Rs] 


(5.30a) 


[EAV.R  1EAV.R  !R  ] 
L  is  f  s  sJ 


(5.30b) 


has  full  rank . 


Proof :  We  begin  by  showing  that  for  any  subspace  SC  of  IRn 


ESC  +  A  =  IRn  <-»  E22)  +  A  =  IRn 

s  s 


(5-31) 


Let  ?  be  a  subspace  so  that 


.?  <B  3  =  ESC  +  A 

s  s 


<3  32) 


E  SC  +  A  =  E(ESC  +  A  )  +  A  =  E(9  Q  A  )  +  A 
s  s'  s  '  s'  s 


=  EJ  ©  A 


(5.33) 


Dimension  counting  then  shows  that  the  r igh t- to- lef t  implication  in  (  >  il  )  i 
true.  Suppose  that  E2!  +  :^s  =  (Rn.  Then 


E22!  ♦  =  E(E9  +  :*  )  ♦  S  =  E(IRn)  +  2  ESC  +  *s  =  IRn  M) 


1/  Ta 

Note  that  by  iterating  (5.31)  we  see  that  i  f  E  2)  +  31  =  (R  for  some  k  ■  O.  it 

equals  IRn  for  all  k  >  0.  A  similar  statement  can  be  made  with  E  replaced  by 

k  l  n 

A.  and  combining  these  we  have  that  EASD  +  dl  =  IR  for  some  pair  k.f  >  0  if 

s 

and  only  if  EA2)  +  £  =  IRn.  The  theorem  then  follows  from  the  application  of 


this  result  with  2)  =  Im{  (V^A+V  E)R  } 


Now  let  us  briefly  present  the  corresponding  concept  of  and  results  on 
weak  observability. 


Definition  5.2:  The  system  (2. 1)— (2.3)  is  weakly  observable  off  [K.I.]  if  the 
map  from  z^[K.L]  to  {y( j) | jfc[0.K]U[L,N]} ,  defined  by  (2.3)  and  the  four-po i n  t 
boundary-value  problem  (3.5)-(3.7)  with  v  =  0.  u  =  0.  is  one-to-one.  The 
weakly  unobservable  subspace  Cw(K,L)  is  the  kernel  of  this  map.  The  system  is 
weakly  observable  if  it  is  weakly  observable  off  [K.L]  (i.e.,  if 
0  { K.L)  =  (0})  for  all  K.Le[n-l ,N-n+l] . 


Theorem  5.4:  The  dimension  of  ©^(K.L)  is  constant  for  K.I.i  [n-1  ,  N-n+ 1  ] . 


Corollary:  The  system  (2.1)-(2.3)  is  weakly  observable  if  it  is  weak!; 


observable  off  some  [K.L], 


A  consequence  of  this  last  result  is  tliat  in  order  to  test  for  weak 
observability  we  need  only  examine  the  unobservabi 1 i ty  space  f  (k.k+1)  of 
Zo(k,k+l)  =  Bu(k).  Furthermore,  note  that  C^(k.k+1)  is  the  kernel  of  the 

mapping  from  Bu(k)  to  the  full  sequence  of  measurements  y(0) . y ( N )  (with  v 

set  to  zero).  This  is  weaker  than  strong  observability  which  involves  the  usi 
of  outputs  restricted  to  lie  within  n  time  steps  of  k. 

Theorem  5.5:  Let  ke[n,N-n].  Then 


Cw(k.k) 


ker 


C 

s 

Cs(V.A+V^)AN"k“1Ek 


0 

s 


=  ker 


_  v  .N-k-l^k 
0  V  .  A  L 

s  1 

N-k-1  u 
0  V.A  E 

s  f 


Note  that  kerC  (k.k)  C  ker<?  ,  demonstrating  again  that,  weak  observability  is 

w  s 

weaker  condition. 


Theorem  5.G:  The  system  (2.1)-(2.3)  is  weakly  observable  if  and  only  if 
either  of  the  matrices 


0 

s 

0  (V . A+V  E)AE 

s  i  i 


(a.  It  a) 


3T> 


-r'-r. 


.  v.  . 


i 


s 

! 

I 


*  I 

P: 

r, 

S 


i 


0 


0  V.AE 
s  1 


0  V.AE 
s  f 


(5.3Gb) 


has  full  rank. 


VI.  Efficient  Solution  of  TPBVDS’s 


Unlike  causal  systems,  the  solution  of  a  TPBVDS  cannot  be  computed  using 
a  simple  recursion  since  the  solution  x(k)  depends  on  inputs  over  the  entire 
interval.  There  are,  however,  several  efficient  methods  for  solution  which  we 
describe  in  this  section. 


6.1  The  Two-Filter  Solution 


In  his  study  Krener  derived  a  solution  by  solving  his  continuous-time 
linear  system  assuming  a  zero  initial  condition  and  then  correcting  for  t  tie 
actual  boundary  conditions.  Since  E  and  A  may  both  be  singular  for  a  TPBVDS. 
the  analogous  procedure,  first  described  in  [22],  is  somewhat  more  complex  as 
we  must  identify  which  parts  of  the  system  can  be  solved  in  the  forward  and 
backward  directions. 


From  Kronecker's  canonical  form  for  a  regular  pencil  [It]  we  can  find 

4 


nonsingular  matrices  T  and  F  so  that 


4Th. 


e  decomposition  in  [It]  splits  the  pencil  zE-A  into  forward  dynamics 


corresponding  to  a  pencil  of  the  form  zI-Aj  and  backward  dynamics 
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>■«.  i't  j*.  is 


(6. la) 


A  0 


0  I 


((5.  lb) 


so  that  all  of  the  eigenvalues  of  and  A2  have  magnitudes  no  larger  than  1. 


Def ine 


x^k) 


*2{k) 


=  Tx(k) 


(6.2) 


Then  we  obtain 


xL(k+l)  =  AjX^k)  +  BjuCk) 
x2(k)  =  A2x2(k+i)  -  B2u(k) 


(6.3a) 


(6.3b) 


where 


(6.1) 


Note  that  (6.3a),  (6.3b)  are  asymptotically  stable  recursions  if  |zE-.\|  has 


zeros  on  the  unit  circle.  Finally,  given  the  transformation  (6.2),  the 


boundary  condition  (2.2)  takes  the  form 


Xj(0) 


[V..IV-.]  mx  +  [v,r:v.K] 

L  li  2iJ  X„(0)  L  If  2fJ  x 


xl(N) 


(6.3) 


corresponding  to  z  -Ag  where  is  nilpotent.  The  only  difference  in  (6.1). 


(6.2)  is  that  the  unstable  forward  modes  of  A^  have  been  shifted  into  the 


backward  dynamics  A 2* 


'J 


Employing  the  forward/backward  representation  (6.3)  of  the  dynamics,  a 
general  solution  to  (2.1),  (2.2)  is  derived  as  follows.  Let  x^(k)  denote  the 
solution  to  (6.3a)  with  zero  initial  condition,  and  let  x2^(k)  denote  the 
solution  of  (6.3b)  with  zero  final  condition.  Then 


x^k)  =  A1kx(0)  +  x1°(k)  (6.7a) 

x2(k)  =  A2N  kx2(N)  +  x2°(k)  (6.7b) 


Substituting  (6.7)  into  (6.5)  and  solving  for  x^(0)  and  x2(K)  yields 


(0) 

x0(N) 


=  H  1{v-Vlfx1°(N)-V2.x20(0)} 


(6.9) 


where 


11  =  [Vli+VlNAlN:V2iA2N  *  V2f]  =  V.T'1(FET'1)N  +  Vf.T"1(FAT"1)N  (6. 0) 


Finally,  substituting  (6.8)  into  (6.7)  yields 


X1  (k) 

x0(k) 


A  k  0 

1  N-k 
0  A2Nk 


.-1 


v-VlfXl°(N)-V21x2°(0) 


0,,  . 

Xj  (k) 

x2°(k)  J 


(6.10) 


The  solution  in  the  original  basis  can  then  be  obtained  by  inverting  (6.2). 
Note  that  the  transformed  matrices  in  (6.1),  (6.2)  commute  and  are  in 


fact  in  a  form  close  to  our  normalized  form  (see  discussion  in  the  next 
section).  However,  the  full  importance  of  transforming  the  system  into 
normalized  form,  and  in  particular  its  implication  for  a  generalized 
Cay ley-Hami 1  ton  theorem  and  the  resulting  form  of  reachability  and 
observability  results,  has  not  been  previously  recognized.  Also,  the 
algorithm  just  described  provides  an  equivalent  wel 1-posedness  condition, 
namely  the  inver tibi 1 i ty  of  H  in  (6.9). 

6.2  A  Parallel  Outward- Inward  Solution 

A  second  efficient  algorithm  can  be  constructed  by  noting  that  the 
solution  x  can  be  recovered  from  the  outward  process  z^  and  the  inward  process 
z For  simplicity,  let  us  assume  that  N  is  odd  and  that  E  and  A  commute  (as 
they  would  if  (2.1),  (2.2)  is  in  normalized  form).  It  is  then  possible  to 
specify  a  recursive  algorithm  for  the  computation  of  zo(j,N-j)  for 

j  =  0 . (N-l)/2,  starting  from  the  initial  condition  at  the  center  of  the 

interval  (with  j  =  (N-l)/2): 

zq( ( N— 1 )/2,  (N+l )/2)  =  Bu((N-l)/2)  (6.11) 

and  propagating  symmetrically  outward  from  the  center: 

zo(j-l.N-j+l)  =  EAzo(j,N-j)  +  AN_2j+1Bu(j-l)  +  EN_2j+1Bu(N-j )  (6.12) 

Similarly  we  can  compute  z.(j.N-j)  recursively  inward  from  the  initial 


condition 


using  a  recursive  procedure  based  on  that  outlined  in  Section  III  (see 
(3.13)  -  (3.21)). 


The  solution  x  can  then  be  computed  as 


X(j) 

_AN-2j  EN-2j 

-1 

zo(j.N-j) 

x(N-j) 

W.(j.N-j)  Wf(j.N-j) 

z^j-N-j) 

(6.14) 


where  the  inverse  on  the  right-hand  side  of  (6.14)  is  guaranteed  to  exist 
thanks  to  the  wel 1-posedness  of  (2.1),  (2.2). 

6.3  A  Serial  Outward- Inward  Solution 

As  a  first  step  in  this  algorithm  we  compute  Zo(j,N-j)  outward  from  the 
interval  center  as  in  (6.11),  (6.12).  We  then  use  these  values,  together  with 
the  boundary  condition  v,  to  solve  for  x(j)  and  x(N-j)  recursively  as  we 
propogate  back  toward  the  interval  center.  To  begin,  note  that 


x(0) 

-A* 

en  ' 

-1 

zo(0,N) 

x(N) 

V1 

vf 

V 

where  the  inverse  indicated  on  the  right-hand  side  of  (6. 15)  is  again 
guaranteed  to  exist  thanks  to  wel 1-posedness .  To  continue  with  the  inward 
recursion,  note  that  from  (3.1) 


(6.16) 


In  this  paper  we  have  analysed  some  of  the  system-theoretic  properties  of 
TPBVDS’s.  As  in  Krener’s  analysis  of  continuous- time,  non-descriptor, 
boundary-value  systems,  there  are  actually  two  distinct  concepts  for 
reachability  and  for  observability  of  TPBVDS's,  and  in  this  paper  we  have 
investigated  each  of  these.  In  addition,  we  have  described  three  methods  for 
the  efficient  solution  of  TPBVDS’s,  one  based  on  a  variation  on  Kronecker's 
form  for  a  regular  pencil  and  two  on  the  inward/outward  recursions  and 
processes  that  play  such  an  important  role  in  the  analysis  of  these  systems. 
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An  important  step  in  our  analysis  is  the  introduction  of  a  standard  form 
for  regular  pencils.  This  form  permits  us  to  obtain  a  simple  form  for  a 
generalized  Cay ley-Hami 1  ton  theorem  which  in  turn  leads  to  simpler 
reachability  and  observability  results  than  have  appeared  previously  in  the 
literature.  It  is  worth  noting  that  this  generalized  Cay ley-Hami 1  ton  theorem 
and  the  resulting  reachability  and  observability  results  continue  to  hold  if  E 
and  A  take  the  form 


where  E.,  A.  are  in  standard  form,  i.e.,  a.E.  +  B. A.  =  I,  but  where  ( a,,B.) 

11  ii  i  l  v  1  K1 ' 

and  need  not  be  the  same.  An  example  of  such  a  form  is  the  variation 

of  Kronecker's  form  given  in  (6.1),  (6.2). 

There  are  a  variety  of  extensions  and  complements  to  the  results 
presented  in  this  paper.  Many  of  these  involve  the  examination  of  stationary 
TPBVDS's,  i.e.,  models  as  in  (6.1),  (6.2)  but  for  which  the  Green’s  function 
G(k,t)  in  (2.17)  depends  only  on  k-t.  As  we  will  describe  in  a  subsequent 
paper,  the  analysis  of  stationary  TPBVDS's  can  be  significantly  simplified  and 
extended.  For  example,  the  description  and  recursive  computation  of  the 
inward  process  z^(k,t)  is  far  simpler  in  the  stationary  case.  In  addition, 
for  such  systems  the  weak  reachability  and  unobservability  spaces  are 
time-invariant  (i.e.,  they  don’t  rotate).  As  a  simple  example,  consider  the 
class  of  cyclic  processes,  i.e.,  processes  for  which  =  -V^.  =  I  (so  that 
x(0)  =  x(N)  if  v  =  0).  Not  only  is  such  a  process  stationary  but  from  (5.16). 
(5.35)  we  see  that  in  this  case  weak  and  strong  reachability  and  observability 


concepts  coincide. 

There  are  also  a  number  of  other  system-theoretic  concepts  that  can  be 
developed  in  detail  for  a  stationary  TPBVDS’s.  For  example,  there  exists  a 
minimal  realization  theory  for  such  systems  analogous  to  that  described  by 
Krener.  In  addition,  it  is  possible  to  develop  a  concept  of  stability  for 
such  systems,  reflecting  the  effect  that  the  boundary  conditions  have  on  the 
process  near  the  center  of  the  interval  as  the  boundaries  recede.  Not  only  is 
such  a  concept  useful  in  determining  the  numerical  wel 1-posedness  of 
algorithms  such  as  those  described  in  Section  VI,  but  it  also  provides  the 
basis  for  analyzing  stochastic  TPBVDS's  and  the  properties  of  optimal 
estimators  for  such  processes  [22], 
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